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RENILLA LUCIFERASE AND GREEN FLUORESCENT PROTEIN FUSION GENES 
CROSS-REFERENCE TO RELATED APPLICATIONS 

The present Application is a International Application corresonding to United 
States Patent Application 08/771,850, filed December 23, 1996, entitled "The Construction 
and Expression of Renilla Luciferase and Green Fluorescent Protein Fusion Genes"; and is a 
Continuation-in-Part of United States Provisional Patent Application 60/027,657, filed October 
4, 1996, entitled "The Construction and Expression of Renilla Luciferase and Green 
Fluorescent Fusion Genes in E. coli and Mammalian Cells," the contents of which are 
incorporated herein by reference in their entirety. 

BACKGROUND 

Green Fluorescent Protein (GFP) is a light emitting protein purified from the 
jellyfish Aequorea victoria. GFP can emit green light by accepting energy transfer from 
sources that include exogenous blue light and Renilla luciferase catalyzed reactions. The gene 
for GFP was cloned and its cDNA is a powerful reporter gene in a variety of living systems, 
including bacteria, fungi, and mammalian tissues. The UV light stimulated GFP fluorescence 
does not require cofactors and the gene product alone can be sufficient to allow detection of 
living cells under the light microscope. 

By modifying the wild type GFP protein, red-shifted GFP variants with bright 
emission have also been produced. These variants include EGFP, GFPS65T and RSGF. 
Recently, GFP was expressed in a human cell line and in vivo. C. Kaether, H.H. Gerdes. 
Visualization of protein transport along the secretory pathway using green fluorescent protein. 
FEBS-Lett. 1995; 369:267-71. "Humanized" GFP was synthesized with nucleotide changes 
that did not change the amino acid sequences with one exception. 

Renilla luciferase is an enzyme purified from Renilla reniformis. The enzyme 
catalyzes the oxidative decarboxylation of coelenterazine in the presence of oxygen to produce 
blue light with an emission wavelength maximum of 478 nm. In Renilla reniformis cells, 
however, this reaction is shifted toward the green with a wavelength maximum of 510 nm due 
to an energy transfer to a Green Fluorescent Protein. 

The gene for Renilla luciferase (rue) was cloned and its cDNA was shown to be 
useful as a reporter gene in various living systems. D.C. Prasher, V.K. Eckenrode, W.W. 
Ward, F.G. Prendergast, M J. Cormier. Primary structure of the Aequorea victoria green- 
fluorescent protein. Gene 1992; 111:229-33. By providing appropriate promoters to the 
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cDNA as gene cassettes, the gene was expressed in bacteria, transformed plant cells, and 
mammalian cells. The high efficiency of Renilla luciferase is a useful trait as a marker 
enzyme for gene expression studies. 

Given the properties of GFP and Renilla luciferase, it would be useful to have a 
5 single protein combining the functions of both Renilla luciferase enzymes and GFP to monitor 
gene expression quantitatively by UV light excitation or qualitatively by enzyme activity 
measurements. 

SUMMARY 

According to one embodiment of the present invention, there are provided 
10 fusion gene constructs comprising the cDNA of Renilla luciferase and the cDNA of the 

"humanized" Aequorea green fluorescent protein. The fusion gene constructs were used to 
transform both prokaryotic and eukaryotic cells. One construct was expressed as a polypeptide 
having a molecular weight of about 65 kDa. This polypeptide, the "Renilla-GFP fusion 
protein," was Afunctional, displaying both the luciferase activity of Renilla luciferase and the 
15 green fluorescence of GFP. The Renilla-GFP fusion gene is useful as a double marker for 
monitoring gene expression in living cells and quantitatively by enzymatic activity. 

The invention includes a protein comprising a polypeptide having both 
luciferase and GFP activities, or biologically active variants of a polypeptide having both 
luciferase and GFP, or a protein recognized by a monoclonal antibody having affinity to the 
20 polypeptide having both luciferase and GFP activities. The polypeptide can be made by 
recombinant DNA methods. 

The invention further includes a high affinity monoclonal antibody that 
immunoreacts with the polypeptide. The antibody can have an Fc portion selected from the 
group consisting of the IgM class, the IgG class and the IgA class. The invention also 
25 includes a high affinity monoclonal antibody that immunoreacts with a polypeptide having both 
luciferase and GFP activities. 

The invention further includes a polynucleotide sequence coding for a 
polypeptide having both luciferase and GFP activities, or its complementary strands, and a 
polynucleotide sequence that hybridizes to such a sequence and that codes on expression for a 
30 polypeptide having both luciferase and GFP activities, or its complementary strands. 

The invention further includes a purified and isolated DNA molecule 
comprising a polynucleotide coding for a polypeptide having both luciferase and GFP 
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activities, or its complementary strands. The polynucleotide can comprise the sequence as set 
forth in SEQ ID NO:l. 

The invention further includes a vector containing a DNA molecule coding for a 
polypeptide having both luciferase and GFP activities. The polynucleotide can comprise the 

5 sequence as set forth in SEQ ID NO: 1. The vector can be used to stably transform or 
transiently transfect a host cell. 

The invention further includes a method of making a polypeptide having both 
luciferase and GFP activities. The method comprises the steps of, first, culturing a 
microorganism transformed with a polynucleotide vector containing a gene cassette coding for 

10 a polypeptide having both luciferase and GFP activities. Next, the polypeptide having both 
luciferase and GFP activities is recovered. 

The invention further includes a method of quantifying promoter activations and 
GFP fluorescence based on luciferase activity measurements. The method comprises the step 
of providing the polypeptide according to the present invention. 

15 The invention further includes a method of making a monoclonal antibody that 

immunoreacts with a polypeptide having both luciferase and GFP activities. The method 
comprises the steps of, first, administering to a host a polypeptide having both luciferase and 
GFP activities in an amount sufficient to induce the production of antibodies to the polypeptide 
from the host's antibody-producing cells. Next, the antibody -producing cells are recovered 

20 from the host. Then, cell hybrids are formed by fusing the antibody-producing cell to cells 
capable of substantially unlimited reproduction. Then, the hybrids are cultured. Next, the 
monoclonal antibodies are collected as a product of the hybrids. 

The invention further includes a method of monitoring gene expression 
quantitatively and qualitatively in a cell using a gene fusion construct coding for a polypeptide 

25 having both luciferase and GFP activities. The method comprises the steps of, first, providing 
a gene fusion construct coding for a polypeptide having both Renilla luciferase and GFP 
activity. Next, the gene fusion construct is introduced into the cell. Then, the cell containing 
the gene fusion construct is maintained in a manner allowing the cell to express the 
polypeptide. Then, the cell is measured for luciferase and fluorescent activity. The construct 

30 can include a polynucleotide sequence as set forth in SEQ ID NO:l. 

The invention further includes a method of monitoring gene expression 
quantitatively and qualitatively in a cell using a gene fusion construct coding for a polypeptide 
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having both luciferase and GFP activities. The method comprises the steps of, first, providing 
a gene fusion construct coding for a polypeptide having both luciferase and GFP activities. 
Next, the gene fusion construct is introduced into the cell. Then, the cell containing the gene 
fusion construct is maintained in a manner allowing the cell to express the polypeptide. Next, 
the cell is measured for luciferase and fluorescent activity. 

FIGURES 

These and other features, aspects and advantages of the present invention will 
become better understood with regard to the following description, appended claims, and 
accompanying figures where: 

Figure 1 is a schematic diagram showing the construction of a Renilla luciferase 
and "humanized" GFP fusion gene cassette according to the present invention for gene 
expression in E. coli where "RG," top, is the fusion gene cassette with the Renilla luciferase 
coding sequence (rue) at the 5' terminus, and «GR,» bottom, is the fusion gene cassette with 
the GFP coding sequence (gfp h ) at the 5' terminus; 

Figure 2 is a schematic diagram showing the construction of Renilla luciferase 
and "humanized" GFP fusion gene cassette according to the present invention for gene 
expression in mammalian cells where "RG," top, is the fusion gene cassette with the Renilla 
luciferase coding sequence (rue) at the 5' terminus, and "GR," bottom, is the fusion gene 
cassette with the GFP coding sequence (gfp h ) at the 5' terminus; 

Figure 3 is a map of the plasmids used for cloning and expression of the RG 
gene construct in E. coli (top) and the GR gene construct in E. coli (bottom); 

Figure 4 is a map of the plasmids used for cloning and expression of the RG 
gene construct in mammalian systems (top) and the GR gene construct in mammalian systems 
(bottom); 

Figure 5 are photomicrographs of cells transformed by the fusion genes using 
fluorescence microscopy and fluorescence imaging to show GFP activity; 

Figure 6 are bar graphs of luciferase activity of the fusion gene constructs in E. 
coli (top) and mammalian cells (bottom); 

Figure 7 is a spectroscopic measurement of Renilla luciferase activity and GFP 
30 activity in E. coli; 

Figure 8 is a Western blot showing the detection of fusion gene expression in E. 
coli using ami-Renilla luciferase antibody; 
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Figure 9 are photomicrographs of mouse embryonic stem cells using 
fluorescence image analysis demonstrating the expression of the RG fusion gene; and 

Figure 10 are photomicrographs of mouse embryos using fluorescence image 
analysis demonstrating the expression of the RG fusion gene. 

DESCRIPTION 

According to one embodiment of the present invention, there is provided a 
fusion gene comprising the cDNA of Renilla luciferase and the cDN A of the "humanized" 
Aequorea green fluorescent protein. According to another embodiment of the present 
invention, there is provided a single polypeptide that exhibits both Renilla luciferase and GFP 
activities. This Afunctional polypeptide can facilitate the identification of transformed cells at 
the single cell level, in cell cultures, transformed tissues and organs based on fluorescence of 
the polypeptide. At the same time, the polypeptide can also be used to quantify promoter 
activations and GFP fluorescence based on luciferase activity measurements. 

The cDNA of Renilla reniformis luciferase (rue) has been cloned and used 
successfully as a marker gene in a variety of transgenic species. See, for example, Lorenz, 
W.W. McCann, R.O., Longiaru, M. and Cormier, MJ. Isolation and expression of a cDNA 
encoding Renilla reniformis luciferase. Proc. Natl. Acad. Sci. USA 1991; 88:4438-4442; 
Mayerhofer, R., Langridge, W.H.R., Cormier, M.J., and Szalay, A. A. Expression of 
recombinant Renilla luciferase in transgenic plants results in high levels of light emission. The 
Plant Journal 1995; 7:1031-1038; and Lorenz, W.W., Cormier, M.J., O'Kane, D.J., Hua, 
D., Escher, A. A. Szalay, A. A. Expression of the Renilla reniformis luciferase gene in 
mammalian cells. J. Biolumin. Chemilumin. 1995; 11:31-37, incorporated herein by 
reference in their entirety. Similarly, the transfer and expression of Green-Fluorescent-Protein 
(GFP) cDNA from Aequorea victoria resulted in high levels of GFP in transformed cells that 
allowed convenient visualization of individual cells under the microscope. See, for example, 
Chalfie, M., Tu, Y., Euskirchen, G., Ward, W.W. and Prasher, D.C. Green fluorescent 
protein as a marker for gene expression. Science 1994; 263:802-805, incorporated herein by 
reference in its entirety. 

The present invention involves the production of fusion genes from the cDNA 
of Renilla (rue) and the cDNA of the "humanized" Aequorea GFP (gfp h ). A description of 
"humanized" Aequorea GFP (gfp h ) can be found, for example, in Zolotukhin, S., Potter, M., 
and Huaswirth, W.W., Guy, J., and Muzyczka, N. A "humanized" green fluorescent protein 
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cDNA adapted for high-level expression in mammalian cells. J. Virology 1996: 70:4646- 
4654, incorporated herein by reference in its entirety. 

The first fusion gene, designated the "RG fusion gene," SEQ ID NO:l and 
shown at the top of Figures 1 and 2, contains the Renilla cDNA linked at the modified 3' end 
5 to a fifteen polynucleotide linker sequence encoding five amino acids, Ala-Ala-Ala-Ala-Thr, 
residues 312-316 of SEQ ID NO:l, followed by the 5' end of the intact GFP cDNA. The 
second fusion gene, designated the "GR fusion gene," SEQ ID NO:2 and shown at the bottom 
of Figures 1 and 2, contains the cDNA of GFP linked to a twenty-seven polynucleotide linker 
sequence encoding nine amino acids, Gly-Try-Gln-Ile-Glu-Phe-Ser-Leu-Lys, residues 239-247 
10 of SEQ ID NO:2, followed by the 5' end of Renilla cDNA. Both genes were placed into 

prokaryotic pGEM-5zf(+) and eukaryotic pCEP4 expression vectors, and transformed into E. 
coli, and various mammalian cell lines, and microinjected into mouse embryos. PT 7 was the 
bacterial T7 promoter used for gene expression. P cmv was the CMV promoter used for gene 
expression in mouse fibroblast cells, embryonic stem cells and mouse embryos. 

15 Unexpectedly, only cells transformed with the RG fusion gene gave strong 

fluorescence while the cells containing the GR fusion gene exhibited minimal response to UV 
light under the microscope. In contrast, luciferase measurements in homogenates of cells 
transformed with RG gene cassettes or with GR gene cassettes were indistinguishable from 
each other in both bacterial and mammalian cells. Further, spectrofluorimeter data indicated 

20 that there was energy transfer between Renilla luciferase and GFP in the RG fusion gene 
containing cells but did not indicate such energy transfer in cells containing the GR fusion 
gene. The protein expressed in the RG fusion gene containing cells was analyzed and found to 
be a 65 kDa polypeptide. A detailed description of the construction and expression of the 
fusion genes, and analyses of their protein products is given below. 

25 Productio n of the Fusion Gene Constructs- 

The vectors used for cloning and expression of the gene constructs in E. coli 
and mammalian systems were pGEM-5zf(+) (Promega) and pCEP4, respectively. Figure 3 is 
a map of the plasmids used for cloning and expression of the RG gene construct in E. coli, 
pGEM-5zf(+)-RG (top) and the map of the plasmids used for cloning and expression of the 

30 GR gene construct in E. coli, pGEM-5zf(+)-GR (bottom). Both were under the 

transcriptional control of T7 promoter. The E. coli strains which were transformed were 
DLT101 and DH5a. 
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Similarly, Figure 4 is a map of the plasmids used for cloning and expression of 
the RG gene construct in mammalian systems, pCEP4-RG (top), and a map of the plasmids 
used for cloning and expression of the GR gene construct in mammalian systems, pCEP4-GR 
(bottom). Both were under the transcriptional control of CMV promoter. The mammalian 
5 cell line that was transformed was LM-TK" embryonic stem cells and embryos. 

Five primers were designed for cloning the RG and GR gene constructs. Single 
underlines indicate Shine-Dalgarno sequences. Double underlines indicate the restriction sites. 
The stan codons are in bold. Sequences in bold italics indicate the removal of stop codons 
from both rue and gfp h genes. 
10 Primer 1, SEQ ID NO:3: RUC5: 5' CTGCAG (PstI) 

AGGAGG AATTC AGCTTAAAGATG3 ' 
Primer 2. SEQ ID NO:4: RUC3: 5 ' GCGGCCGC (Not I) TTG TTC A.TTTTTGAGAAC3 ' 
Primer 3, SEQ ID NO: 5: GFP5 : 5 ' GG GGT ACC (Kpnl) 

CCATG AGCAAGGGCGAGGAACT3 ' 
15 Primer 4, SEQ ID NO: 6: GFP3: 5' GG GGT ACC (Kpnl) 

CCr7UTACAGCTCGTCCATGCCA3 ' 
Primer 5, SEQ ID NO:7: GFP5a 5' CCCGGG (Smal) 

AGQAGGTACCCCATGAGCAAG3' . 
The Renilla luciferase-GFP fusion gene (RG gene cassette) and the GFP- 
20 Renilla luciferase fusion gene (GR gene cassette) were constructed by removing the stop 
codons, and by adding restriction sites and Shine-Dalgarno sequences to the 5' end of the 
cDNAs using PCR according to techniques known to those with skill in the art. The PCR 
products were cloned using the pGEM-T system (Promega Corporation, Madison, WI). 
Primers were designed so that the downstream cDNA is in frame with the upstream cDNA. 
25 The linker sequences are shown in Figures 1 and 2 and described above. After cloning, the 
RG and GR gene cassettes were under the transcriptional control of T7 in pGEM-5zf( + ) 
vector and CMV in pCEP4 vector, which were used for expression in E. coli and mammalian 
cells, respectively. 

Determinatio n of activity of fusion genes and their corresponding protein products : 
30 GFP activity in vivo was visualized as follows. E. coli strain DH5a was 

transformed with the plasmids pGEM-5zf( + )-RG and pGEM-5zf(-f)-GR. Positive colonies 
were identified and cultured in LB medium with 100 ^ig/ml of ampicillin selection, according 
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to techniques known to those with skill in the art. Twelve hours later, one drop of £. coli 
culture was put on a slide and visualized by fluorescent microscopy at 1000 x magnification. 
LM-TK* cells were transfected with plasmids pCEP4-RG and pCEP4-GR using calcium 
phosphate methods known to those with skill in the art. The culture dishes were monitored 
5 using an inverted fluorescent microscope 12 hours after the transfection. 

Luciferase activity was assayed as follows. An aliquot of transformed E. coli 
was used for a luciferase assay in a Turner TD 20e luminometer (Turner Designs, Sunnyvale, 
CA), both before and after IPTG induction. The results were recorded as relative light units. 
Mammalian cells harvested 36 hrs after transfection were measured for luciferase activity. 

10 Corrected emission spectra were detected spectrofluorimetrically using a SPEX 

fluorolog spectrofluorimeter operated in the ratio mode. Fluorescence emission was excited at 
390 nm. Bioluminescence emission was recorded with the excitation beam blocked following 
the addition of 0. 1 tig of coelenterazine in acidified methanol. Five spectra were averaged for 
each sample over a wavelength range from 400 to 600 nm. 

15 The fusion proteins were isolated and immunoactivity detected as follows. 1 ml 

of E. coli (00600=1.0) was harvested. 400 /xl of cell suspension buffer (0.1M NaCl, 0.01 M 
Tris-HCl pH 7.6, 0.001 M EDTA, 100 /xg/ml PMSF) and 100 fx\ of loading buffer (50 mM 
Tris-HCl pH 6.8, 2% SDS, 10% glycerol, 5% 2-mercaptoethanoI) were added. The samples 
were boiled for 4 min and loaded to a 7.5%-20% gradient SDS-polyacrylamide gel. 

20 Polyclonal anti-Renilla luciferase was used as the primary antibody for detection and goat 
peroxidase anti-IgG (anti-rabbit) as the secondary antibody. 

Referring now to Figure 5, there are shown photomicrographs of GFP activity 
in transformed E. coli cells (5A, left side) and LM-TK" mouse fibroblast cells (5B, right side) 
by fluorescence microscopy and fluorescence imaging. As can be seen, individual E. coli cells 

25 and mammalian cells transformed with the RG fusion gene construct exhibited strong green 
fluorescence under oil immersion. 

Referring now to Figure 6, there are shown bar graphs of luciferase activity of 
the gene constructs in £. coli (top) and mammalian cells (bottom). The white bars indicate 
activity before promoter induction. The black bars indicate activity after promoter induction. 

30 As can be seen, cells transformed with the RG fusion gene construct have significant luciferase 
activity, which is reduced 3-fold in the cells transformed with the GR fusion gene construct. 
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Referring now to Figure 7, there is shown a spectroscopic measurement of 
Renilla luciferase activity and GFP activity in E. coli transformed with various gene 
constructs. As can be seen, cells containing Renilla luciferase gene (short dashes) show only 
one emission peak at approximately 478 nm. Cells containing the GR gene fusion construct 

5 (light solid) also show one emission peak at approximately 478 nm, indicating Renilla 

luciferase activity only. By contrast, cells containing the RG gene fusion construct (heavy 
solid) show an emission peak at approximately 510 nm with excitation at 390 nm. Cells 
containing the RG gene fusion construct with the addition of coelanterizine (long dashes) show 
emission peaks at both approximately 478 nm and 510 nm indicating that the energy transfer 

10 between Renilla luciferase and GFP occurred in these cells. The lack of GFP activity in GR 
gene cassette transformed cell lines could be due to incorrect folding, due to the requirement 
for a free GFP C -terminus, or due to interference of the linker polypeptide with GFP activity 
in the fusion protein, among other possible explanations. 

Referring now to Figure 8, there is shown a western blot used to detect fusion 

15 gene expression in £. coli using ant\-Renilla luciferase antibody. Reading from left to right, 
the "C" lane shows the total protein extracted from non-transformed E. coli cells. The "R M 
lane shows the total protein extracted from E. coli cells transformed with the rue gene alone. 
The "G" lane shows the total protein extracted from E. coli cells transformed with the gfp h 
gene alone. The "RG" lane shows the "total protein extracted from E. coli cells transformed 

20 with the RG fusion gene cassette. The "GR" lane shows the total protein extracted from E. 
coli cells transformed with the GR fusion gene cassette. 

As can be seen, protein extracted from E. coli cells transformed with the rue 
gene alone produced a band with a molecular weight of about 34 kDa. Protein extracted from 
E. coli cells transformed with the RG fusion gene cassette produced a band with a molecular 

25 weight of about 65 kDa. Protein extracted from E, coli cells transformed with the GR fusion 
gene cassette produced a band with a molecular weight of about 34 kDa. These data imply that 
cells transformed with the GR fusion gene cassette produced luciferase but did not produce 
fusion protein. Such a lack of fusion protein production by cells transformed with the GR 
fusion cassette would explain the lack of green fluorescent activity in these cells. 

30 Referring now to Figure 9, there are shown photomicrographs using 

fluorescence image analysis demonstrating the expression of the RG fusion gene in mouse 
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embryonic stem cells transformed by electroporation procedures. Transformed colonies were 
selected based on GFP activity under fluorescence microscopy. 

Referring now to Figure 10, there are shown photomicrographs using 
fluorescence image analysis demonstrating the expression of the RG fusion genes in mouse 
5 embryos. The embryos were injected with the linearized RG plasmid, and in vitro cultured. 
The expression of GFP activity was monitored daily by fluorescent microscope and recorded 
by an imaging collection system. 

Based on this data, we conclude that the RG fusion construct disclosed herein 
can be expressed in both prokaryotic and eukaryotic cells to produce a Afunctional polypeptide 
10 that exhibits both Renilla luciferase and GFP activity. This Afunctional polypeptide is a useful 
tool for identification of transformed cells at the single cell level based on fluorescence. It 
allows the simultaneous quantification of promoter activation in transformed tissues and 
transgenic organisms by measuring luciferase activity. The dual function of this protein allows 
the monitoring of bacterial cells in their living hosts and the differentiation of cells in the 
15 developing embryo and throughout the entire animal. 

Although the present invention has been discussed in considerable detail with 
reference to certain preferred embodiments, other embodiments are possible. Therefore, the 
spirit and scope of the appended claims should not be limited to the description of preferred 
embodiments contained herein. 
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SEQUENCE LISTING 



1.44 Mb storage 



(1) GENERAL INFORMATION: 

(i) APPLICANT : Szalay, Aladar A. 

Wang, Gefu 
Wang, Yubao 

(ii) TITLE OF INVENTION: THE CONSTRUCTION AND EXPRESSION OF 
RENILLA LUCI FERASE AND GREEN FLUORESCENT PROTEIN FUSION GENES 

(ill) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sheldon & Mak 

(B) STREET: 225 S. Lake Avenue, 9th Floor 

(C) CITY: Pasadena 

(D) STATE: California 

(E) ZIP: 91101 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 

(B) COMPUTER: IBM compatible 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: WordPerfect for Windows version 6.1 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: to be assigned 

(B) FILING DATE: September 24, 1997 

(C) CLASSIFICATION: to be assigned 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Farah, David A. 

(B) REGISTRATION NUMBER: 38,134 

(C) REFERENCE/DOCKET NUMBER: 11785-1PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 626/796-4000 

(B) TELEFAX: 626/795-6321 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



ATG 
Met 
1 



ACT TCG 
Thr Ser 



AAA 
Lys 



GTT 
Val 
5 



TAT 
Tyr 



GAT 
Asd 



CCA 
Pro 



GAA CAA AGG 
Glu Gin Arg 
10 



AAA CGG 
Lys Arg 



ATG ATA ACT 
Met He Thr 
15 



48 



GGT 
Gly 



CCG CAG 
Pro Gin 



TGG 
Trp 
20 



TGG 
Trp 



GCC 
Ala 



AGA 
Arg 



TGT 
Cys 



AAA CAA ATG 
Lys Gin Met 
25 



AAT GTT 
Asn Val 



CTT GAT TCA 
Leu Asp Ser 
30 



96 



TTT 
Phe 



ATT AAT 
lie Asn 
35 



TAT 
Tyr 



TAT 
Tyr 



GAT 
Asp 



TCA 
Ser 



GAA 
Glu 
40 



AAA CAT GCA 
Lys His Ala 



GAA AAT 
Glu Asn 
45 



GCT GTT ATT 
Ala Val He 



144 



TTT 
Phe 



TTA CAT 
Leu His 
50 



GGT 
Gly 



AAC 
Asn 



GCG 
Ala 



GCC 
Ala 
55 



TCT 
Ser 



TCT TAT TTA 
Ser Tyr Leu 



TGG CGA 
Trp Arg 
60 



CAT GTT GTG 
His Val Val 



192 



CCA CAT ATT GAG CCA GTA GCG CGG TGT ATT ATA CCA GAT CTT ATT GGT 
Pro His lie Glu Pro Val Ala Arg Cys He He Pro Asp Leu He Gly 
65 70 75 80 



240 



ATG GGC AAA TCA GGC AAA TCT GGT AAT GGT TCT TAT AGG 
Met Gly Lys Ser Gly Lys Ser Gly Asn Gly Ser Tyr Arg 
85 90 



TTA CTT GAT 
Leu Leu Asp 
95 



288 
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CAT TAC AAA TAT CTT ACT GCA TGG TTT GAA CTT CTT AAT TTA CCA AAG 336 

His Tyr Lys Tyr Leu Thr Ala Trp Phe Glu Leu Leu Asn Leu Pro Lys 

100 105 110 

AAG ATC AAT TTT GTC GGC CAT GAT TGG GGT GCT TGT TTG GCA TTT CAT 38 4 

Lys lie lie Phe Val Gly His Asp Trp Gly Ala Cys Leu Ala Phe His 
115 120 125 

TAT AGC TAT GAG CAT CAA GAT AAG ATC AAA GCA ATA GTT CAC GCT GAA 4 32 

Tyr Ser Tyr Glu His Gin Asp Lys lie Lys Ala lie Val His Ala Glu 
130 135 140 

AGT GTA GTA GAT GTG ATT GAA TCA TGG GAT GAA TGG CCT GAT ATT GAA 480 

Ser Val Val Asp Val lie Glu Ser Trp Asp Glu Trp Pro Asp lie Glu 
145 150 155 160 

GAA GAT ATT GCG TTG ATC AAA TCT GAA GAA GGA GAA AAA ATG GTT TTG 52 8 

Glu Asp lie Ala Leu lie Lys Ser Glu Glu Gly Glu Lys Met Val Leu 
165 170 175 

GAG AAT AAC TTC TTC GTG GAA ACC ATG TTG CCA TCA AAA ATC ATG AGA 57 6 

Glu Asn Asn Phe Phe Val Glu Thr Met Leu Pro Ser Lys lie Met Arg 

180 185 190 

AAG TTA GAA CCA GAA GAA TTT GCA GCA TAT CTT GAA CCA TTC AAA GAG 62 4 

Lys Leu Glu Pro Glu Glu. Phe Ala Ala Tyr Leu Glu Pro Phe Lys Glu 
195 200 205 

AAA GGT GAA GTT CGT CGT CCA ACA TTA TCA TGG CCT CGT GAA ATC CCG 67 2 

Lys Gly Glu Val Arg Arg Pro Thr Leu Ser Trp Pro Arg Glu lie Pro 
210 215 220 

TTA GTA AAA GGT GGT AAA CCT GAC GTT GTA CAA ATT GTT AGG AAT TAT 72 0 

Leu Val Lys Gly Gly Lys Pro Asp Val Val Gin lie Val Arg Asn Tyr 
225 230 235 240 

AAT GCT TAT CTA CGT GCA AGT GAT GAT TTA CCA AAA ATG TTT ATT GAA 7 68 

Asn Ala Tyr Leu Arg Ala Ser Asp Asp Leu Pro Lys Met Phe lie Glu 
245 250 255 

TCG GAT CCA GGA TTC TTT TCC AAT GCT ATT GTT GAA GGC GCC AAG AAG 816 
Ser Asp Pro Gly Phe Phe Ser Asn Ala lie Val Glu Gly' Ala Lys Lys 

260 265 270 

TTT CCT AAT ACT GAA TTT GTC AAA GTA AAA GGT CTT CAT TTT TCG CAA 8 64 

Phe Pro Asn Thr Glu Phe Val Lys Val Lys Gly Leu His Phe Ser Gin 
275 280 285 

GAA GAT GCA CCT GAT GAA ATG GGA AAA TAT ATC AAA TCG TTC GTT GAG 912 

Glu Asp Ala Pro Asp Glu Met Gly Lys Tyr lie Lys Ser Phe Val Glu 
290 295 300 

CGA GTT CTC AAA AAT GAA CAA GCG GCC GCC GCC ACC ATG AGC AAG GGC 9 60 

Arg Val Leu Lys Asn Glu Gin Ala Ala Ala Ala Thr Met Ser Lys Gly 
305 310 315 320 

GAG GAA CTG TTC ACT GGC GTG GTC CCA ATT CTC GTG GAA CTG GAT GGC 1008 

Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu Asd Gly 
325 330 335 

GAT GTG AAT GGG CAC AAA TTT TCT GTC AGC GGA GAG GGT GAA GGT GAT 1056 

Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp 

340 345 350 
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GCC AC A TAC GGA AAG CTC ACC CTG AAA TTC ATC TGC ACC ACT GGA AAG 1104 
Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr Gly Lys 
355 360 365 

CTC CCT GTG CCA TGG CCA ACA CTG GTC ACT ACC TTC ACC TAT GGC GTG 1152 
Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Thr Tyr Gly Val 
370 375 380 

CAG TGC TTT TCC AGA TAC CCA GAC CAT ATG AAG CAG CAT GAC TTT TTC 12 00 

Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe 
385 390 395 400 

AAG AGC GCC ATG CCC GAG GGC TAT GTG CAG GAG AGA ACC ATC TTT TTC 12 48 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie Phe Phe 
405 410 415 

AAA GAT GAC GGG AAC TAC AAG ACC CGC GCT GAA GTC AAG TTC GAA GGT 12 9 6 

Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly 
420 425 430 

GAC ACC CTG GTG AAT AGA ATC GAG CTG AAG GGC ATT GAC TTT AAG GAG 134 4 

Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu 
435 440 445 

GAT GGA AAC ATT CTC GGC CAC AAG CTG GAA TAC AAC TAT AAC TCC CAC 13 92 

Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His 
450 455 460 

AAT GTG TAC ATC ATG GCC GAC AAG CAA AAG AAT GGC ATC AAG GTC AAC 1440 
Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly lie Lys Val Asn 
465 470 475 480 

TTC AAG ATC AGA CAC AAC ATT GAG GAT GGA TCC GTG CAG CTG GCC GAC 148 8 

Phe Lys lie Arg His Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp 
485 490 495 

CAT TAT CAA CAG AAC ACT CCA ATC GGC GAC GGC CCT GTG CTC CTC CCA 153 6 

His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro 
500 505 510 

GAC AAC CAT TAC CTG TCC ACC CAG TCT GCC CTG TCT AAA GAT CCC ACC 15 8 4 

As d Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn 
515 520 525 

GAA AAG AGA GAC CAC ATG GTC CTG CTG GAG TTT GTG ACC GCT GCT GGG 163 2 

Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly 
530 535 540 

ATC ACA CAT GGC ATG GAC GAG CTG TAC AAG TGA 166 5 

lie Thr His Gly Met Asp Glu Leu Tyr Lys 
545 550 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1677 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ATG AGC AAG GGC GAG GAA CTG TTC ACT GGC GTG GTC CCA ATT CTC GTG 4 8 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
1 5 10 15 
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GAA CTG GAT GGC GAT GTG AAT GGG CAC AAA TTT TCT GTC AGC GGA GAG 9 6 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Glv Glu 
20 25 30 

GGT GAA GGT GAT GCC ACA TAC GGA AAG CTC ACC CTG AAA TTC ATC TGC 144 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

ACC ACT GGA AAG CTC CCT GTG CCA TGG CCA ACA CTG GTC ACT ACC TTC 192 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

ACC TAT GGC GTG CAG TGC TTT TCC AGA TAC CCA GAC CAT ATG AAG CAG 2 40 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

CAT GAC TTT TTC AAG AGC GCC ATG CCC GAG GGC TAT GTG CAG GAG AGA 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Ara 
85 90 95 

ACC ATC TTT TTC AAA GAT GAC GGG AAC TAC AAG ACC CGC GCT GAA GTC 336 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 HO 

AAG TTC GAA GGT GAC ACC CTG GTG AAT AGA ATC GAG CTG AAG GGC ATT 384 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Glv He 
115 120 125 

GAC TTT AAG GAG GAT GGA AAC ATT CTC GGC CAC AAG CTG GAA TAC AAC 4 32 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 



288 



480 



TAT AAC TCC CAC AAT GTG TAC ATC ATG GCC GAC AAG CAA AAG AAT GGC 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

ATC AAG GTC AAC TTC AAG ATC AGA CAC AAC ATT GAG GAT GGA TCC GTG 52 8 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

CAG CTG GCC GAC CAT TAT CAA CAG AAC ACT CCA ATC GGC GAC GGC CCT 57 6 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Glv Pro 
180 185 190 

GTG CTC CTC CCA GAC AAC CAT TAC CTG TCC ACC CAG TCT GCC CTG TCT 62 4 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTG CTG GAG TTT GTG 672 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

ACC GCT GCT GGG ATC ACA CAT GGC ATG GAC GAG CTG TAC AAG GGG TAC 72 0 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys Gly Tvr 
225 230 235 240 

CAG ATC GAA TTC AGC TTA AAG ATG ACT TCG AAA GTT TAT GAT CCA GAA 7 68 

Gin He Glu Phe Ser Leu Lys Met Thr Ser Lys Val Tyr Asp Pro Glu 
245 250 255 

CAA AGG AAA CGG ATG ATA ACT GGT CCG CAG TGG TGG GCC AGA TGT AAA 816 
Gin Arg Lys Arg Met. He Thr Gly Pro Gin Trp Trp Ala Arg cys Lvs 
260 265 270 

CAA ATG AAT GTT CTT GAT TCA TTT ATT AAT TAT TAT GAT TCA GAA AAA 8 64 

Gin Met Asn Val Leu Asp Ser Phe He Asn Tyr Tyr Asp Ser Glu Lys 
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275 280 285 

CAT GCA GAA AAT GCT GTT ATT TTT TTA CAT GGT AAC GCG GCC TCT TCT 912 
His Ala Glu Asn Ala Val lie Phe Leu His Gly Asn Ala Ala Ser Ser 
290 295 300 

TAT TTA TGG CGA CAT GTT GTG CCA CAT ATT GAG CCA GTA GCG CGG TGT 9 60 

Tyr Leu Trp Arg His Val Val Pro His He Glu Pro Val Ala Arg Cys 
305 310 315 320 

ATT ATA CCA GAT CTT ATT GGT ATG GGC AAA TCA GGC AAA TCT GGT AAT 1008 
He He Pro Asp Leu lie Gly Met Gly Lys Ser Gly Lys Ser Gly Asn 
325 330 335 

GGT TCT TAT AGG TTA CTT GAT CAT TAC AAA TAT CTT ACT GCA TGG TTT 1056 
Gly Ser Tyr Arg Leu Leu Asp His Tyr Lys Tyr Leu Thr Ala Trp Phe 
340 345 350 

GAA CTT CTT AAT TTA CCA AAG AAG ATC ATT TTT GTC GGC CAT GAT TGG 1104 
Glu Leu Leu Asn Leu Pro Lys Lys lie lie .Phe Val Gly His Asp Trp 
355 360 365 

GGT GCT TGT TTG GCA TTT CAT TAT AGC TAT GAG CAT CAA GAT AAG ATC 1152 
Gly Ala Cys Leu Ala Phe His Tyr Ser Tyr Glu His Gin Asp Lys He 
370 375 380 

AAA GCA ATA GTT CAC GCT GAA AGT GTA GTA GAT GTG ATT GAA TCA TGG 12 00 

Lys Ala He Val His Ala Glu Ser Val Val Asp Val lie Glu Ser Trp 
385 390 395 400 

GAT GAA TGG CCT GAT ATT GAA GAA GAT ATT GCG TTG ATC AAA TCT GAA 12 4 8 

Asp Glu Trp Pro Asp He Glu Glu Asp He Ala Leu lie Lys Ser Glu 
405 410 415 

GAA GGA GAA AAA ATG GTT TTG GAG AAT AAC TTC TTC GTG GAA ACC ATG 12 96 

Glu Gly Glu Lys Met Val Leu Glu Asn Asn Phe Phe Val Glu Thr Met 
420 425 430 

TTG CCA TCA AAA ATC ATG AGA AAG TTA GAA CCA GAA GAA TTT GCA GCA 134 4 

Leu Pro Ser Lys He Met Arg Lys Leu Glu Pro Glu Glu Phe Ala Ala 
435 440 445 

TAT CTT GAA CCA TTC AAA GAG AAA GGT GAA GTT CGT CGT CCA ACA TTA 13 92 

Tyr Leu Glu Pro Phe Lys Glu Lys Gly Glu Val Arg Arg Pro Thr Leu 
450 455 460 

TCA TGG CCT CGT GAA ATC CCG TTA GTA AAA GGT GGT AAA CCT GAC GTT 14 40 

Ser Trp Pro Arg Glu lie Pro Leu Val Lys Gly Gly Lys Pro Asp Val 
465 470 475 480 

GTA CAA ATT GTT AGG AAT TAT AAT GCT TAT CTA CGT GCA AGT GAT GAT 14 88 

Val Gin He Val Arg Asn Tyr Asn Ala Tyr Leu Arg Ala Ser Asp Asp 
485 490 495 

TTA CCA AAA ATG TTT ATT GAA TCG GAT CCA GGA TTG TTT TCC AAT GCT 153 6 

Leu Pro Lys Met Phe He Glu Ser Asp Pro Gly Phe Phe Ser Asn Ala 
500 505 510 

ATT GTT GAA GGC GCC AAG AAG TTT CCT AAT ACT GAA TTT GTC AAA GTA 158 4 

He Val Glu Gly Ala Lys Lys Phe Pro Asn Thr Glu Phe Val Lys Val 
515 520 525 

AAA GGT CTT CAT TTT TCG CAA GAA GAT GCA CCT GAT GAA ATG GGA AAA 1632 
Lys Gly Leu His Phe Ser Gin Glu Asp Ala Pro Asp Glu Met Gly Lys 
530 535 ' 540 
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TAT 




Tyr 
545 



1677 



(3) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CTGCAGAGGA GGAATTCAGC TTAAAGATG 

(4) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRI PTION : SEQ ID NO : 4 : 

GCGGCCGCTT GTTCATTTTT GAGAAC 

(5) INFORMATION FOR SEQ ID NO : 5 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : single 
f D) TOPOLOGY : linear 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 5 : 

GGGGTACCCC ATGAGCAAGG GCGAGGAACT 

(6) INFORMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRI PTION : SEQ ID NO : 6 : 

GGGGTACCCC TTGTACAGCT CGTCCATGCC A 

(7) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO : 7 - 



CCCGGGAGGA GGTACCCCAT GAGCAAG 
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WE CLAIM: 

1. A protein comprising a polypeptide having both luciferase and GFP activities or 
biologically active variants thereof. 

2. A recombinant protein according to claim 1. 

5 3. A protein according to claim 1, having an amino acid sequence as set forth in SEQ 

ID NO:l. 

4. A high affinity monoclonal antibody which immunoreacts with the polypeptide of 
claim 1. 

5. The antibody of claim 4 having an Fc portion selected from the group consisting of 
10 the IgM class, the IgG class and the IgA class. 

6. A protein recognized by a monoclonal antibody having affinity to the polypeptide of 
claim 1. 

7. The protein of claim 1 in purified and isolated form. 

8. A DNA sequence coding for a protein according to claim 1, or its complementary 
15 strands. 

9. A DNA sequence which hybridizes to a DNA sequence according to claim 8 and 
which codes on expression for a polypeptide having both luciferase and GFP activities, or its 
complementary strands. 

10. A high affinity monoclonal antibody which immunoreacts with a polypeptide 
20 having both luciferase and GFP activities. 

11. A purified and isolated DNA molecule comprising a polynucleotide coding for a 
polypeptide having both luciferase and GFP activities, or its complementary strands. 

12. The DNA of claim 11, wherein the polynucleotide comprises the sequence as set 
forth in SEQ ID NO:l. 

25 13. A vector containing a DNA molecule coding for a polypeptide having both 

luciferase and GFP activities. 

14. The vector of claim 13, wherein the polynucleotide comprises the sequence as set 
forth in SEQ IDNO:l. 

15. A prokaryotic or eukaryotic host cell stably transformed or transfected by the 
30 vector of claim 13. 

16. A method of making a polypeptide having both luciferase and GFP activities, the 
method comprising the steps of: 
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(a) culturing a microorganism transformed with a polynucleotide coding for a 
polypeptide having both luciferase and GFP activities; and 

(b) recovering the polypeptide having both luciferase and GFP activities. 

17. A method of quantifying promoter activations and GFP fluorescence based on 
luciferase activity measurements, the method comprising the step of providing the polypeptide 
according to claim 1 . 

18. A method of making a monoclonal antibody which immunoreacts with a 
polypeptide having both luciferase and GFP activities, the method comprising the steps of: 

(a) administering to a host a polypeptide having both luciferase and GFP 
activities in an amount sufficient to induce the production of antibodies to the 
polypeptide; 

(b) recovering the antibody-producing cells from the host; 

(c) forming cell hybrids by fusing the antibody-producing cell to cells capable 
of substantially unlimited reproduction; 

(d) culturing the hybrids; and 

(e) collecting the monoclonal antibodies as a product of the hybrids. 

19. A method of monitoring gene expression quantitatively and qualitatively in a cell 
using a gene fusion construct coding for a polypeptide having both luciferase and GFP 
activities, the method comprising the steps of: 

(a) providing a gene fusion construct coding for a polypeptide having both 
Renilla luciferase and GFP activity; 

(b) introducing the gene fusion construct into the cell; 

(c) maintaining the cell containing the gene fusion construct in a manner 
allowing the cell to express the polypeptide; and 

(d) measuring the cell for luciferase and fluorescent activity. 

20. The method of claim 19, where the step of providing comprises providing a 
construct including a polynucleotide sequence as set forth in SEQ ID NO:l. 

21. A method of monitoring gene expression quantitatively and qualitatively in a cell 
using a gene fusion construct coding for a polypeptide having both luciferase and GFP 
activities, the method comprising the steps of: 

(a) providing a gene fusion construct comprising the protein of claim 1; 

(b) introducing the gene fusion construct into the cell; 
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(c) maintaining the cell containing the gene fusion construct in a manner 
allowing the cell to express the polypeptide; and 

(d) measuring the cell for luciferase and fluorescent activity. 



19 



WO 98/14605 



1/10 



FCT/US97/17162 



FIG. / 



Fusion Gene Cassettes for E. coli 
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FIG. 2 
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A METHOD TO ENABLE ASSESSMENT OF GROWTH AND DEATH OF 
MICRO-ORGANISMS 

This invention relates to a method to enable the assessment of growth and death of a 
micro-organism within a chosen time period in an environment of interest. 



5 BACKGROUND OF THE INVENTION 



When studying growth and death of a micro-organism under the influence of 
specific environments, e.g. production and storage environments that e.g. could or 
could not be refrigerated, or involving chemicals or matrixes, e.g. antibiotics, 
microbial toxins, heavy metals and serum complement, microbial cultures are most 
10 often incubated for hours or days. In these circumstances death and growth occur 
simultaneously. If additionally some of the cells lyse, e.g. when analysing the serum 
complement, it is difficult to know to what one should compare the amount of living 
cells at the end of the experiment. Convenient methods to determine the number of 
living cells, e.g. by measuring lucif erase bioluminescence, are known but if no more 
15 information is available it is impossible to assess to what extent growth or/and death 
of the micro-organisms takes or has taken place. 

Growth rates and death rates of micro-organisms in specific environments are of 
interest in many areas. Death rates and growth rates of micro-organisms and 
especially harmful and/or pathogenic micro-organisms are of importance in risk 
assessments of products of the pharmaceutical industry and products for human 
consumption with regard to their production, storage and distribution to the 
consumers. Knowledge of death and growth rates of micro-organisms are of 
particular importance in specific applications such as in the development of 
antibiotics, disinfectants and bactericidal products or monitoring of sterilisation, 
disinfection and cleaning processes. 
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Reporter genes coding for luminescent or/and fluorescent products have been 
introduced to micro-organisms to enable the assessment of the quantity or survival 
of living micro-organisms (WO 96/23898, WO 98/14605, WO 98/30715, 
WO 98/36081, US 5,824,468). Even simultaneous use of luminescent and 
5 fluorescent markers has been used (Fratamico et al., Journal of Food Protection, Vol 
50 No 10, 1997, 1167-1173). Luminescent and fluorescent markers have, however, 
only been used as markers for survival of micro-organisms and the use of two 
different markers within one micro-organism enabling the differentiation between 
growth and death rates has not been reported. 

1 0 OBJECT AND SUMMARY OF THE INVENTION 

The object of the present invention is to provide a method to enable the assessment 
of the growth and death of a micro-organism within a chosen time period in an 
environment of interest by introducing into said micro-organism at least two 
reporter genes. The method is characterised in that 

15 a) said reporter genes code for luminescent and/or fluorescent products and 
within said time period and environment at least two said products of the 
following are produced: 

i) an essentially stable product produced in a, within the environment of 
interest, essentially known proportion to the total amount of cells of said 

20 micro-organism that are or have been alive within said chosen time 

period, 

ii) a product present in said environment of interest in an essentially known 
proportion to the amount of cells alive at any moment within said chosen 
time period and 
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iii) an essentially stable product produced in a, within the environment of 
interest, essentially known proportion to the total amount of cells of said 
micro-organism that have died within said chosen time period, 

and said products can be measured through their luminescence and/or 
5 fluorescence; 

b) said micro-organism is incubated within the environment of interest and said 
luminescence and/or fluorescence is detected after said chosen time period, 
and 

c) the growth and death rate of the said micro-organism is assessed based on at 
10 least two of the following: 

i) the known proportion of luminescence or fluorescence to the amount of 
cells alive after any said chosen time period, 

ii) the known proportion of luminescence or fluorescence to the total 
amount of cells that are or have been alive within any said chosen time 

15 period, and 

iii) the known proportion of luminescence or fluorescence to the total 
amount of cells that have died within any said chosen time period. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows plasmid pGFP+luc* including genes for both GFP and firefly 
20 lucif erase. 

Figure 2 shows fluorescence during growth phase of E. coli with plasmid 
pGFP+luc* at 30 °C as a function of the concentration of ethanol in the cell culture. 

Figure 3 shows luminescence during growth phase of E. coli with plasmid 
pGFP+luc* at 30 °C as a function of the concentration of ethanol in the cell culture. 
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Figure 4 shows the amount of living cells, i.e. colony forming units, according to 
plating during growth phase of E. coli with plasmid pGFP+luc* at 30 °C as a 
function of the concentration of ethanol in the cell culture. 

Figure 5 shows the percentage of living cells according to live/dead staining and 
5 flow cytometric analysis during growth phase of E. coli with plasmid pGFP+luc* at 
30 °C as a function of the concentration of ethanol in the cell culture. 

Figure 6 shows fluorescence before and after incubation with serum complement 
during growth phase of E. coli with plasmid pGFP+luc* at 30 °C as a function of 
the concentration of serum complement in the cell culture. 

10 Figure 7 shows luminescence before and after incubation with serum complement 
during growth phase of E. coli with plasmid pGFP+luc* at 30 °C as a function of 
the concentration of serum complement in the cell culture. 

Figure 8 shows the percentage of living cells according to plating during growth 
phase of E. coli with plasmid pGFP+luc* at 30 °C as a function of the concentration 
15 of serum complement in the cell culture. 

DETAILED DESCRIPTION OF THE INVENTION 

The method according to the present invention can be used to assess the growth and 
death rate of a micro-organism within a chosen time period in any particular 
environment of interest. The method is applicable if two different marker genes can 

20 be introduced to the micro-organism that code for luminescent and/or fluorescent 
products, and the products of these fulfil at least two of the following three criteria: 
a) a said luminescent product luminesces or said fluorescent product fluoresces in 
an essentially known proportion to the amount of cells of said micro-organism alive 
within said chosen time period ; 

25 b) a said luminescent product luminesces or said fluorescent product fluoresces in 
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an essentially known proportion to the amount of cells of said micro-organism that 
are or have been alive within said chosen time period, and 

c) a said luminescent product luminesces or said fluorescent product fluoresces in 
an essentially known proportion to the amount of cells of said micro-organism that 
5 have died within said chosen time period. 

In the present application the concept "micro-organism" means any micro-organism 
into which marker genes can be introduced so, that they will function according to 
the invention. "Micro-organism" can therefore stand for bacteria, yeast or fungi. 

The concept of "'introducing a marker gene into a micro-organism" means any 
10 method by which a marker gene can be made to function within the micro-organism 
according to the invention. One way of introducing marker genes into micro- 
organism is by constructing a recombinant strain. This can be done by transforming 
a strain with a plasmid including the marker genes. An alternative way to introduce 
reporter genes to bacteria is to utilise transposable elements. In this technique, 
15 reporter genes are inserted between insertion sequences in a delivery plasmid. The 
plasmid is then introduced to a cell by e.g. conjugation of transformation, and once 
inside the cell, genes surrounded by the insertion sequences are integrated into 
bacterial chromosome. Integration is stable, i.e. there is no need for a selectable 
marker such as antibiotic resistance. 

Assessment of the growth rate and death rate of a micro-organism can be of interest 
in many specific environments. Within pharmaceutical research the effect of 
different drugs and candidates for drugs, e.g. antibiotics, on the survival of 
pathogenic, but also the beneficial micro-organisms of the gut, is of interest. Thus 
the ultimate interest is in the behaviour of these micro-organisms in a physiological 
environment affected by drugs. 

Another vast area where the possibility of assessing growth and death rate of 
specific micro-organisms is of interest is that of production, processing, storage and 
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distribution of all products for human consumption. In this area the behaviour of 
pathogenic or potentially harmful micro-organisms in the different environments of 
the life cycle of these products is of special interest and involves many different 
aspects such as the influence of temperature, humidity or light and the possible use 
5 of preservatives etc. 

Additionally growth and death rates of micro-organisms can be of interest for 
environmental evaluations e.g. when evaluating the effect of emissions into the 
environment. 

Luminescent or fluorescent products coded by reporter genes in different 
10 embodiments of this invention can vary as long as their proportion to either the total 
amount of cells alive, to cells that are or have been alive, or to cells that have died is 
essentially known. Growth and death rate can be assessed if two of the following: 
cells alive, cells that are or have been alive, or cells that have died can be 
determined. Thus luminescence and/or fluorescence measured can be e.g. of a 
15 product which is expressed e.g. constitutively or triggered by a specific phase 
(e.g. replication or death) of the lifecycle of each cell, is stable or labile or which 
luminescence or fluorescence is dependant on a factor that relates e.g. to a specific 
phase of the lifecycle of each cell. Depending on the individual characteristics of 
said product — how produced, stable or labile, possible dependence of its 
20 luminescence or fluorescence of said factors etc. — the measured luminescence or 
fluorescence can be in proportion to one of the three unknown of which two must be 
known to be able to assess the growth rate and death rate of said cells. 

According to one specific embodiment of the invention assessment of the growth 
and death rate of an Esherichia coli strain under the influence of different chemicals 
25 or matrixes was enabled by constructing a recombinant strain, which expresses both 
luciferase and GFP. Altogether the effect of a number of different chemicals and 
matrixes, such as CdCl 2 , ethanol, the antibiotics chloramphenicol, rifampicin, and 
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tetracyclic as well as serum complement on said recombinant £*. coli strain was 
tested and found applicable. 

The invention will be described in more detail by the following study in which the 
growth rate and death rate of a recombinant Esherichia coli strain, which expresses 
5 both luciferase and GFP, is assessed under the influence of ethanol or serum 
complement. 

Summary of the study 

Genes for luciferase and green fluorescent protein have recently raised interest as 
reporter genes. Luciferase is an enzyme that produces luminescence in the presence 
10 of substrate luciferin, molecular oxygen and ATP. Green fluorescent protein (GFP), 
produces green fluorescence when excited with light. Many mutated forms of GFP 
have been introduced: some have different excitation and emission wavelengths 
from the wild type and some mutants form more stable proteins at higher 
temperatures. 

15 We constructed a recombinant strain of E. coli, which expresses both luciferase and 
GFP. In our construction we used a mutant of GFP, which is more stable at 
temperatures over +30 °C and it matures quicker than the wild type. Luciferase was 
from North American firefly, Photinus pyralis. 

The E. coli strain MCI 061 was transformed with a plasmid including genes for both 
20 GFP and firefly luciferase. Figure 1 describes the plasmid in general. The sequence 
of the plasmid is disclosed in the sequence listing. Essential codings of the sequence 
are as follows: 



lac promoter 



95-199 



GFP 



289-1008 



25 



firefly luciferase 
(3-lactamase 



1044-2696 



3251-4111 
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In our construct, see Figure 1, the luciferase gene is situated next to the GFP gene 
and both genes are transcribed in the same direction. The transcription is started at 
the lac promoter in front of GFP. The lac promoter is constitutively active, because 
the MCI 061 cells lack its repressor. The plasmid also has a gene for ampicillin 
resistance (p-lactamase). 

The transformed E. coli strain was propagated under the influence of different 
concentrations of ethanol or serum complement. 

Methods 

Growth conditions 

One colony from a pure culture plate was inoculated in 5 ml of LB -medium with 
ampicillin (100 fig/ml) and grown at +37 °C in a shaker, 250 rpm, for about three 
hours. After that, the number of cells per millilitre was determined with flow 
cytometry by using fluorescent spherical latex particles as a reference. One million 
cells were then removed to an erlenmeyer with 50 ml of LB medium and ampicillin. 
The culture was grown over night in a shaker, 190 rpm, at room temperature to 
prevent the culture from growing into the stationary phase during the night. In the 
morning, the culture was transferred to and grown in a shaker, 330 rpm, until the 
stationary phase was reached or used after growth at +30 °C for about 1 h to study 
the influence of ethanol or serum complement as described below. 

Influence of chemicals on the propagation of E. coli 

The culture obtained as described above was used to study the influence of ethanol 
or serum complement as follows: 
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Ethanol 

Ethanol (Aa, Primalco Oy) was diluted into pure water to obtain concentrations of 
50, 45, 25, 10, 5, 1 and 0 % of ethanol when 500 \x\ of said dilution was added to 
500 \xl of said culture in an eppendorf tube. The mixture was vortexed and incubated 
5 for 5 minutes before measuring fluorescence and luminescence. Live cells were 
again counted by plating and also by live/dead staining. In the live/dead protocol 
used the stain cyto 9 stains all cells whereas propidium iodide stains only the dead 
cells. After staining, cells are passed through a flow cytometer, with which dead and 
live cells can be differentiated and their proportion determined. (Virta et al. (1998) 
10 Appl. Environ. Microbiol. 64: 515-519.) 

Serum complement 

The influence of serum complement on the said recombinant E. coli strain was 
studied using an incubation time of 90 min as described for a different recombinant 
E. coli strain used in Virta et al. (1998) Appl. Environ. Microbiol. 64: 515-519. 

15 Fluorescence and luminescence measurements 

The measurements were done with a combined fluoro- and luminometer, Fluoroscan 
Ascent FL, provided by Labsystems Ltd. (Helsinki, Finland). Cell growth was 
simultaneously followed with a flow cytometer. 

For the measurements, 100 \x\ of bacterial culture was pipetted into the microtiter 
20 plate wells. Fluorescence was measured using 485 nm for excitation and 510 nm for 
emission. Measuring time was 20 ms. After the fluorescence measurement 100 julI of 
luciferin in 0.1 M citric acid-sodium citrate buffer (pH 5.0) was dispensed into the 
wells and the plate was shaken for two minutes (shaking diameter 1 mm, 
1 020 rpm), after which luminescence was recorded with a measuring time of 
25 1000 ms. 
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Plating 

Samples for plating were diluted 10 2 to 10 7 fold with 150 mM NaCl and plated onto 
L agar plates (L broth containing 1.6 % agar). Colonies were counted after 
overnight incubation at 37 °C. 

5 Live/dead staining and Flow cytometric analysis 

Bacteria from 1 000 jil of cell culture were used for live/dead staining and flow 
cytometric analysis using a LIVE/DEAD BacLight bacterial viability kit (catalogue 
no. L-7005) for microscopy and quantitative analysis obtained from Molecular 
Probes Europe (Leiden, The Netherlands) and Fluoresbrite beads (diameter, 1.8 jum) 
10 obtained from Polysciences Inc. (Warrington, Pa.) as described in Virta et al. (1998) 
Appl. Environ. Microbiol. 64: 515-519. 

Results 

When the cultures were transferred to +30 °C, the cells grew logarithmically for 1 — 
4 hours depending on the initial cell concentration. Luminescence and fluorescence 
15 rose logarithmically and were essentially constant per cell. Thus cell number could 
be assessed based on luminescence or fluorescence. 

When ethanol was added in different concentrations to the growth medium (see 
Figures 4 and 5) death was, after a very short incubation period of 5 min, more or 
less unsignificant at ethanol concentrations below 5 % and became more significant 

20 with increasing ethanol concentration reaching very pronounced significance at 
ethanol concentrations above 10 %. Correspondingly fluorescence (Figure 2) was 
essentially constant whatever the ethanol concentration in spite of dramatically 
decreasing corresponding live cell count according to plate count (Figure 4) and 
percentage of live cells according to the live/dead staining (Figure 5) whereas 

25 luminescence (Figure 3) dropped dramatically essentially corresponding to the 
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dramatic drop in plate count (Figure 4) and the percentage of live cells (Figure 5) 
with increased ethanol concentration. 

The effect of serum complement on the growth and death of E, coli is shown in 
Figures 6 to 8. Fluorescence (Figure 6) and luminescence (Figure 7) are shown 
5 before (squares) and after (circles) incubation for 90 minutes with serum 
complement. Fluorescence (Figure 6) is slightly increased, during incubation 
regardless of the concentration of serum, whereas luminescence (Figure 7) 
decreases during incubation with increasing serum concentration. The decrease of 
luminescence during incubation with increasing concentrations of serum correlates 
10 clearly with the percentage of cells alive after incubation (Figure 8). 
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CLAIMS 

1. A method to enable the assessment of the growth rate and death rate of a 
micro-organism within a chosen time period in an environment of interest by 
5 introducing into said micro-organism at least two reporter genes, which method is 
characterised in that 

a) said reporter genes code for luminescent and/or fluorescent products and 
within said time period and environment at least two said products of the 
following are produced: 

10 i) an essentially stable product produced in a, within the environment of 

interest, essentially known proportion to the total amount of cells of said 
micro-organism that are or have been alive within said chosen time 
period, 

ii) a product present in said environment of interest in an essentially known 
15 proportion to the amount of cells alive at any moment within said chosen 

time period, and 

iii) an essentially stable product produced in a, within the environment of 
interest, essentially known proportion to the total amount of cells of said 
micro-organism that have died within said chosen time period, 

20 and said products can be measured through their luminescence and/or 

fluorescence; 

b) said micro-organism is incubated within the environment of interest and said 
luminescence and/or fluorescence is detected after said chosen time period, 
and 

25 c) the growth and death rate of the said micro-organism is assessed based on at 
least two of the following: 



WO 00/75367 



PCT/FI00/00507 



13 

i) the known proportion of luminescence or fluorescence to the amount of 
cells alive after any said chosen time period, 

ii) the known proportion of luminescence or fluorescence to the total 
amount of cells that are or have been alive within any said chosen time 

5 period, and 

iii) the known proportion of luminescence or fluorescence to the total 
amount of cells that have died within any said chosen time period. 

2. The method according to claim 1 characterised in that said micro-organism is 
a gram negative bacteria, e.g. Escherichia coli. 

10 3. The method according to claim 1 or 2 characterised in that 

a) one reporter gene coding for a luminescent product is luciferase, which is used 
for the determination of amount of cells alive at any moment within said chosen 
time period, and 

b) another reporter gene coding for a fluorescent product is green fluorescent 
15 protein (GFP), which is used for the determination of total amount of cells of said 

micro-organism that are or have been alive within said chosen time period. 

4. The method according to claim 1 or 2 characterised in that said reporter genes 
are introduced into said micro-organism in a plasmid. 

5. A method according to the methods of claim 3 and 4 characterised in that said 
20 plasmid is pGFP+luc* (SEQ ID NO: 1). 
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SEQUENCE LISTING 

<110> LILIUS, Esa-Matti 

<120> A Method to Enable Assessment of Growth and Death of 
Micro- orgna i sms 

<130> pGFP+luc* -plasmid 

<140> 
<141> 

<160> 4 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 5051 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: pGFP+luc* 
plasmid 

<220> 

<221> promoter 
<222> (95) . . (199) 
<223> lac promoter 

<220> 
<221> CDS 

<222> (289) . . (1008) 
<223> GFP 

<220> 
<221> CDS 

<222> (1044) . . (2696) 
<223> firefly luciferase 

<220> 
<221> CDS 

<222> (3251) . . (4111) 
<223> beta-lactamase 

<400> 1 



agcgcccaat 


acgcaaaccg 


cctctccccg 


cgcgttggcc 


gattcattaa 


tgcagctggc 


60 


acgacaggtt 


tcccgactgg 


aaagcgggca 


gtgagcgcaa 


cgcaattaat 


gtgagttagc 


120 


tcactcatta 


ggcaccccag 


gctttacact 


ttatgcttcc 


ggctcgtatg 


ttgtgtggaa 


180 


ttgtgagcgg 


ataacaattt 


cacacaggaa 


acagctatga 


ccatgattac 


gecaagcttg 


240 


catgcctgca 


ggtcgactct 


agaggatccc 


cgggtaccgg 


tcgccacc atg gtg age 


297 



Met Val Ser 
1 

aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg gtc gag ctg 345 
Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val Glu Leu 
5 10 15 

gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc gag ggc gag 393 
Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu 
20 25 30 35 



40 
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ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate tgc acc acc 441 
Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys Thr Thr 
40 45 50 

ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc ctg acc tac 489 
Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr 
55 60 65 

ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag cag cac gac 53 7 
Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp Kis Met Lys Gin His Asp 
70 75 80 

ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag cgc acc ate 585 
Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg Thr lie 
85 90 95 

ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag gtg aag ttc 633 
Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe 
100 105 110 115 

gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc ate gac ttc 681 
Glu Gly Asp Thr Leu Val Asn Axy lie Glu Leu Lys Gly lie Asp Phe 
120 125 130 

aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac aac tac aac 729 
Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn 
135 140 145 

age cac aac gtc tat ate atg gcc gac aag cag aag aac ggc ate aag 777 
Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly lie Lys 
150 155 160 

gtg aac ttc aag ate cgc cac aac ate gag gac ggc age gtg cag etc 825 
Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val Gin Leu 
165 170 175 

gcc gac cac tac cag cag aac acc ccc ate ggc gac ggc ccc gtg ctg 873 
Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro Val Leu 
180 185 190 195 

ctg ccc gac aac cac tac ctg age acc cag tec gcc ctg age aaa gac 921 
Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser Lys Asp 
200 205 210 

ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc gtg acc gcc 969 
Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala 
215 220 225 

gcc ggg ate act etc ggc atg gac gag ctg tac aag taa agcggccgct 1018 
Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

230 235 240 

ctagaactag tggatccccc gtacc atg gaa gac gcc aaa aac ata aag aaa 1070 

Met Glu Asp Ala Lys Asn He Lys Lys 
245 

ggc ccg gcg cca ttc tat ccg eta gag gat gga acc get gga gag caa 1118 
Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp Gly Thr Ala Gly Glu Gin 
250 255 260 265 

ctg cat aag get atg aag aga tac gcc ctg gtt cct gga aca att get 1166 
Leu His Lys Ala Met Lys Arg Tyr Ala Leu Val Pro Gly Thr He Ala 
270 275 280 
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ttt aca gat gca cat ate gag gtg aac ate acg tac gcg gaa tac ttc 1214 
Phe Thr Asp Ala His lie Glu Val Asn lie Thr Tyr Ala Glu Tyr Phe 
285 290 295 

gaa atg tec gtt egg ttg gca gaa get atg aaa cga tat ggg ctg aat 1262 
Glu Met' Ser Val Arg Leu Ala Glu Ala Met Lys Arg Tyr Gly Leu Asn 
300 305 310 

aca aat cac aga ate gtc gta tgc agt gaa aac tct ctt caa ttc ttt 1310 
Thr Asn His Arg lie Val Val Cys Ser Glu Asn Ser Leu Gin Phe Phe 
315 320 325 

atg ccg gtg ttg ggc gcg tta ttt ate gga gtt gca gtt gcg ccc gcg 1358 
Met Pro Val Leu Gly Ala Leu Phe lie Gly Val Ala Val Ala Pro Ala 
330 335 340 345 

aac gac att tat aat gaa cgt gaa ttg etc aac agt atg aac att teg 1406 
Asn Asp lie Tyr Asn Glu Arg Glu Leu Leu Asn Ser Met Asn lie Ser 
350 355 360 

cag cct acc gta gtg ttt gtt tec aaa aag ggg ttg caa aaa att ttg 1454 
Gin Pro Thr Val Val Phe Val ser Lys Lys Gly Leu Gin Lys lie Leu 
365 370 375 

aac gtg caa aaa aaa tta cca ata ate cag aaa att att ate atg gat 1502 
Asn Val Gin Lys Lys Leu Pro lie lie Gin Lys lie lie lie Met Asp 
380 385 390 

tct aaa acg gat tac cag gga ttt cag teg atg tac acg ttc gtc aca 1550 
Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser Met Tyr Thr Phe Val Thr 
395 400 405 

tct cat eta cct ccc ggt ttt aat gaa tac gat ttt gta cca gag tec 1598 
Ser His Leu Pro Pro Gly Phe Asn Glu Tyr Asp Phe Val Pro Glu Ser 
410 415 420 425 

ttt gat cgt gac aaa aca att gca ctg ata atg aac tec tct gga tct 1646 
Phe Asp Arg Asp Lys Thr lie Ala Leu lie Met Asn Ser Ser Gly Ser 
430 435 440 

act ggg tta cct aag ggt gtg gee ctt ccg cat aga act gec tgc gtc 1694 
Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His Arg Thr Ala Cys Val 
445 450 455 

aga ttc teg cat gee aga gat cct att ttt ggc aat caa ate att ccg 1742 
Arg Phe Ser His Ala Arg Asp Pro lie Phe Gly Asn Gin lie He Pro 
460 465 470 

gat act gcg att tta agt gtt gtt cca ttc cat cac ggt ttt gga atg 179 0 
Asp Thr Ala He Leu Ser Val Val Pro Phe His His Gly Phe Gly Met 
475 480 485 

ttt act aca etc gga tat ttg ata tgt gga ttt cga gtc gtc tta atg 1838 
Phe Thr Thr Leu Gly Tyr Leu He Cys Gly Phe Arg Val Val Leu Met 
490 495 500 505 

tat aga ttt gaa gaa gag ctg ttt tta cga tec ctt cag gat tac aaa 1886 
Tyr Arg Phe Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys 
510 515 520 

att caa agt gcg ttg eta gta cca acc eta ttt tea ttc ttc gec aaa 1934 
He Gin Ser Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys 
525 530 535 
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age act ctg att gac aaa tac gat tta tct aat tta cac gaa att get 1982 
Ser Thr Leu lie Asp Lys Tyr Asd Leu Ser Asn Leu His Glu lie Ala 
540 545 550 

tct ggg ggc gca cct ctt teg aaa gaa gtc ggg gaa gcg gtt gca aaa 203 0 
Ser Gly Gly Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys 
555 560 565 

cgc ttc cat ctt cca ggg ata cga caa gga tat ggg etc act gag act 2078 
Arg Phe His Leu Pro Gly lie Arg Gin Gly Tyr Gly Leu Thr Glu Thr 
570 575 580 585 

aca tea get att ctg att aca ccc gag ggg gat gat aaa ccg ggc gcg 212 6 
Thr Ser Ala lie Leu lie Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala 
590 595 600 

gtc ggt aaa gtt gtt cca ttt ttt gaa gcg aag gtt gtg gat ctg gat 2174 
Val Gly Lys Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp 
605 610 615 

ace ggg aaa acg ctg ggc gtt aat cag aga ggc gaa tta tgt gtc aga 2222 
Thr Gly Lys Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys val Arg 
620 625 630 

gga cct atg att atg tec ggt tat gta aac aat ccg gaa gcg acc aac 2270 
Gly Pro Met lie Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn 
635 640 645 

gee ttg att gac aag gat gga tgg eta cat tct gga gac ata get tac 2318 
Ala Leu lie Asp Lys Asp Gly Trp Leu His Ser Gly Asp lie Ala Tyr 
650 655 660 665 

tgg gac gaa gac gaa cac ttc ttc ata gtt gac cgc ttg aag tct tta 2366 
Trp Asp Glu Asp Glu His Phe Phe lie Val Asp Arg Leu Lys Ser Leu 
670 675 680 

att aaa tac aaa gga tac cag gtg gec ccc get gaa ttg gag teg ata 2414 
He Lys Tyr Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser He 
685 690 695 

ttg tta caa cac ccc aac ate ttc gac gcg ggc gtg gca ggt ctt ccc 2462 
Leu Leu Gin His Pro Asn He Phe Asp Ala Gly Val Ala Gly Leu Pro 
700 705 710 

gac gat gac gec ggt gaa ctt ccc gec gee gtt gtt gtt ttg gag cac 2510 
Asp Asp Asp Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His 
715 720 725 

gga aag acg atg acg gaa aaa gag ate gtg gat tac gtc gec agt caa 2558 
Gly Lys Thr Met Thr Glu Lys Glu He Val Asp Tyr Val Ala Ser Gin 
730 735 740 745 

gta aca acc gec aaa aag ttg cgc gga gga gtt gtg ttt gtg gac gaa 2 606 
Val Thr Thr Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu 
750 755 760 

gta ccg aaa ggt ctt acc gga aaa etc gac gca aga aaa ate aga gag 2 654 
Val Pro Lys Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys He Arg Glu 
765 770 775 

ate etc ata aag gee aag aag ggc gga aag tec aaa ttg taa 2696 
He Leu He Lys Ala Lys Lys Gly Gly Lys Ser Lys Leu 
780 785 790 



WO 00/75367 



PCT/FI00/00507 



aatgtaactg 


tattcagcga 


tgacgaaatt 


cttagctatt 


gtaatactct 


aggggctgca 


2756 


ggaattcgat 


atcaagctta 


tcgataccgt 


cgacctcgag 


ggggggccct 


ttcgtctcgc 


2816 


gcgtttcggt 


gatgacggtg 


aaaacctctg 


acacatgcag 


ctcccggaga 


cggtcacagc 


2876 


ttgtctgtaa 


gcggatgccg 


ggagcagaca 


agcccgtcag 


ggcgcgtcag 


cgggtgttgg 


2936 


cgggtgtcgg 


ggctggctta 


actatgcggc 


atcagagcag 


attgtactga 


gagtgcacca 


2996 


tatgcggtgt 


gaaataccgc 


acagatgcgt 


aaggagaaaa 


taccgcatca 


ggcggcctta 


3056 


agggcctcgt 


gatacgccta 


tttttatagg 


ttaatgtcat 


gataataatg 


gtttcttaga 


3116 


cgtcaggtgg 


cacttttcgg 


ggaaatgtgc 


gcggaacccc 


tatttgttta 


tttttctaaa 


3176 


tacattcaaa 


tatgtatccg 


ctcatgagac 


aataaccctg 


ataaatgctt 


caataatatt 


3236 


gaaaaaggaa 


gagt atg agt att caa cat ttc cgt 


gtc gcc ctt 


att ccc 


3286 



Met Ser lie Gin His Phe Arg Val Ala Leu lie Pro 
795 800 

ttt ttt gcg gca ttt tgc ctt cct gtt ttt get cac cca gaa acg ctg 3334 
Phe Phe Ala Ala Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu 
805 810 815 

gtg aaa gta aaa gat get gaa gat cag ttg ggt gca cga gtg ggt tac 33 82 
Val Lys Val Lys Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Gly Tyr 
820 825 830 835 

ate gaa ctg gat etc aac age ggt aag ate ctt gag agt ttt cgc ccc 343 0 
lie Glu Leu Asp Leu Asn Ser Gly Lys lie Leu Glu Ser Phe Arg Pro 
840 845 850 

gaa gaa cgt ttt cca atg atg age act ttt aaa gtt ctg eta tgt ggc 3478 
Glu Glu Arg Phe Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly 
855 860 865 

gcg gta tta tec cgt att gac gcc ggg caa gag caa etc ggt cgc cgc 3 526 
Ala Val Leu Ser Arg lie Aso Ala Gly Gin Glu Gin Leu Gly Arg Arg 
870 875 880 

ata cac tat tct cag aat gac ttg gtt gag tac tea cca gtc aca gaa 3574 
lie His Tyr Ser Gin Asn Asp Leu Val Glu Tyr Ser Pro Val Thr Glu 
885 890 895 

aag cat ctt acg gat ggc atg aca gta aga gaa tta tgc agt get gcc 3622 
Lys His Leu Thr Asp Gly Met Thr Val Arg Glu Leu Cys Ser Ala Ala 
900 905 910 915 

ata acc atg agt gat aac act gcg gcc aac tta ctt ctg aca acg ate 3670 
lie Thr Met Ser Asp Asn Thr Ala Ala Asn Leu Leu Leu Thr Thr lie 
920 925 930 

gga gga ccg aag gag eta acc get ttt ttg cac aac atg ggg gat cat 3718 
Gly Gly Pro Lys Glu Leu Thr Ala Phe Leu His Asn Met Gly Asp His 
935 940 945 

gta act cgc ctt gat cgt tgg gaa ccg gag ctg aat gaa gcc ata cca 3766 
Val Thr Arg Leu Asp Arg Trp Glu Pro Glu Leu Asn Glu Ala lie Pro 
950 955 960 

aac gac gag cgt gac acc acg atg cct gta gca atg gca aca acg ttg 3814 
Asn Asp Glu Arg Asp Thr Thr Met Pro Val Ala Met Ala Thr Thr Leu 
965 970 975 
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cgc aaa eta tta act ggc gaa eta ctt act eta get tec egg caa caa 3862 
Arg Lys Leu Leu Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin Gin 
980 985 990 995 

tta ata gac tgg atg gag gcg gat aaa gtt gca gga cca ctt ctg cgc 3 910 
Leu lie Asp Trp Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu Arg 
1000 1005 1010 

teg gee ctt ccg get ggc tgg ttt att get gat aaa tct gga gee ggt 3958 
Ser Ala Leu Pro Ala Gly Trp Phe lie Ala Asd Lys Ser Gly Ala Gly 
1015 1020 1025 

gag cgt ggg tct cgc ggt ate att gca gca ctg ggg cca gat ggt aag 4006 
Glu Arg Gly Ser Arg Gly lie lie Ala Ala Leu Gly Pro Asp Gly Lys 
1030 1035 1040 

ccc tec cgt ate gta gtt ate tac acg acg ggg agt cag gca act atg 4054 
Pro Ser Arg He Val Val He Tyr Thr Thr Gly Ser Gin Ala Thr Met 
1045 1050 1055 

gat gaa cga aat aga cag ate get gag ata ggt gee tea ctg att aag 4102 
Asp Glu Arg Asn Arg Gin He Ala Glu He Gly Ala Ser Leu He Lys 
1060 1065 1070 1075 



cat tgg taa ctgtcagacc aagtttactc atatatactt tagattgatt 4151 
His Trp 



taaaacttca 


tttttaattt 


aaaaggatct 


aggtgaagat 


cctttttgat 


aatctcatga 


4211 


ccaaaatccc 


ttaacgtgag 


ttttcgttcc 


actgagegtc 


agaccccgta 


gaaaagatca 


4271 


aaggatcttc 


ttgagatcct 


ttttttctgc 


gcgtaatctg 


ctgcttgcaa 


acaaaaaaac 


4331 


caccgctacc 


agcggtggtt 


tgtttgccgg 


atcaagagct 


accaactctt 


tttccgaagg 


4391 


taactggctt 


cagcagagcg 


cagataccaa 


atactgtcct 


tctagtgtag 


ccgtagttag 


4451 


gccaccactt 


caagaactct 


gtagcaccgc 


ctacatacct 


cgctctgcta 


atcctgttac 


4511 


cagtggctgc 


tgccagtggc 


gataagtcgt 


gtcttacegg 


gttggactca 


agacgatagt 


4571 


taceggataa 


ggegcagegg 


tegggctgaa 


eggggggtte 


gtgcacacag 


cccagcttgg 


4631 


agegaacgae 


ctacaccgaa 


ctgagatacc 


tacagcgtga 


gctatgagaa 


agcgccacgc 


4691 


ttcccgaagg 


gagaaaggcg 


gacaggtatc 


eggtaagegg 


cagggtegga 


acaggagagc 


4751 


gcacgaggga 


gcttccaggg 


ggaaacgect 


ggtatcttta 


tagtcctgtc 


gggtttcgcc 


4811 


acctctgact 


tgagegtega 


tttttgtgat 


getegtcagg 


ggggeggage 


ctatggaaaa 


4871 


acgccagcaa 


cgcggccttt 


ttacggttcc 


tggccttttg 


ctggcctttt 


gctcacatgt 


4931 


tctttcctgc 


gttatcccct 


gattctgtgg 


ataacegtat 


taccgccttt 


gag t gage tg 


4991 


ataccgctcg 


ccgcagccga 


acgaccgagc 


geagegagtc 


agtgagcgag 


gaagcggaag 


5051 



<210> 2 
<211> 239 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: pGFP+luc* 
plasmid 



4& 
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<400> 2 






Met 


Val 


Ser 


Lys 


1 








Val 


Glu 


Leu 


Asp 








20 


Glu 


Gly 


Glu 


Gly 






35 




Cys 


Thr 


Thr 


Gly 




50 






Leu 


Thr 


Tyr 


Gly 


65 








Gin 


His 


Asp 


Phe 


Arg 


Thr 


He 


Phe 








100 


Val 


Lys 


Phe 


Glu 






115 




He 


Asp 


Phe 


Lys 




130 






Asn 


Tyr 


Asn 


Ser 


145 








Gly 


He 


Lys 


Val 


Val 


Gin 


Leu 


Ala 








180 


Pro 


Val 


Leu 


Leu 






195 




Ser 


Lys 


Asp 


Pro 




210 






Val 


Thr 


Ala 


Ala 



225 



Gly Glu Glu Leu 
5 

Gly Asp Val Asn 

Asp Ala Thr Tyr 
40 

Lys Leu Pro Val 
55 

Val Gin Cys Phe 
70 

Phe Lys Ser Ala 
85 

Phe Lys Asp Asp 

Gly Asp Thr Leu 
120 

Glu Asp Gly Asn 
135 

His Asn Val Tyr 
150 

Asn Phe Lys He 
165 

Asp His Tyr Gin 

Pro Asp Asn His 
200 

Asn Glu Lys Arg 
215 

Gly He Thr Leu 
230 



Phe 


Thr 


Gly 


Val 




10 






Gly 


His 


Lys 


Phe 


25 








Glv 


Lys 


Leu 


Thr 


Pro 


Trp 


Pro 


Thr 








60 


Ser 


Arg 


Tyr 


Pro 






75 




Met 


Pro 


Glu 


Gly 




90 






Gly 


Asn 


Tyr 


Lys 


105 








Val 


Asn 


Arcr 


He 


He 


Leu 


Gly 


His 








140 


He 


Met 


Ala 


Asp 






155 




Arg 


His 


Asn 


He 




170 






Gin 


Asn 


Thr 


Pro 


185 








Tyr 


Leu 


Ser 


Thr 


Asp 


His 


Met 


Val 








220 


Gly 


Met 


Asp 


Glu 






235 





Val 


Pro 


He 


Leu 






15 




Ser 


Val 


Ser 


Gly 




30 






Leu 


Lys 


Phe 


He 


45 








Leu 


Val 


Thr 


Thr 


Asp 


His 


Met 


Lys 








80 


Tyr 


Val 


Gin 


Glu 






95 




Thr 


Arg 


Ala 


Glu 




110 






Glu 


Leu 


Lys 


Gly 


125 








Lys 


Leu 


Glu 


Tvr 

JT 


Lys 


Gin 


Lys 


Asn 








160 


Glu 


Asp 


Gly 


Ser 






175 




He 


Gly 


Asp 


Gly 




190 






Gin 


Ser 


Ala 


Leu 


205 








Leu 


Leu 


Glu 


Phe 


Leu 


Tyr 


Lys 





<210> 3 
<211> 550 
<212> PRT 

<213> Artificial Sequence 
<223> Description of Artificial 
plasmid 

<400> 3 



Met 


Glu 


Asp 


Ala 


Lys 


Asn 


He 


Lys 


1 








5 








Leu 


Glu 


Asp 


Gly 


Thr 


Ala 


Gly 


Glu 








20 










Tyr 


Ala 


Leu 


Val 


Pro 


Gly 


Thr 


He 






35 










40 


Val 


Asn 


He 


Thr 


Tyr 


Ala 


Glu 


Tyr 




50 










55 




Glu 


Ala 


Met 


Lys 


Arg 


Tyr 


Gly 


Leu 


65 










70 






Cys 


Ser 


Glu 


Asn 


Ser 


Leu 


Gin 


Phe 










85 








Phe 


He 


Gly 


Val 


Ala 


Val 


Ala 


Pro 








100 










Glu 


Leu 


Leu 


Asn 


Ser 


Met 


Asn 


He 






115 










120 


Ser 


Lys 


Lys 


Gly 


Leu 


Gin 


Lys 


He 




130 










135 




He 


He 


Gin 


Lys 


He 


He 


He 


Met 


145 










150 






Phe 


Gin 


Ser 


Met 


Tyr 


Thr 


Phe 


Val 










165 








Asn 


Glu 


Tyr 


Asp 


Phe 


Val 


Pro 


Glu 



180 



Sequence : pGFP+luc* 



Lys 


Gly 


Pro 


Ala 


Pro 


Phe 


Tyr 


Pro 




10 










15 




Gin 


Leu 


His 


Lys 


Ala 


Met 


Lys 


Arg 


25 










30 






Ala 


Phe 


Thr 


Asp 


Ala 


His 


lie 


Glu 










45 








Phe 


Glu 


Met 


Ser 


Val 


Arg 


Leu 


Ala 








60 










Asn 


Thr 


Asn 


His 


Arg 


He 


Val 


Val 






75 










80 


Phe 


Met 


Pro 


Val 


Leu 


Gly 


Ala 


Leu 




90 










95 




Ala 


Asn 


Asp 


He 


Tyr 


Asn 


Glu 


Arg 


105 










110 






Ser 


Gin 


Pro 


Thr 


Val 


Val 


Phe 


Val 










125 








Leu 


Asn 


Val 


Gin 


Lys 


Lys 


Leu 


Pro 








140 










Asp 


Ser 


Lys 


Thr 


Asp 


Tyr 


Gin 


Gly 






155 










160 


Thr 


Ser 


His 


Leu 


Pro 


Pro 


Gly 


Phe 




170 










175 




Ser 


Phe 


Asp 


Arg 


Asp 


Lys 


Thr 


He 


185 










190 
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Ala Leu lie Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val 

195 200 205 

Ala Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp 

210 215 220 

Pro lie Phe Gly Asn Gin lie He Pro Asp Thr Ala He Leu Ser Val 
225 230 235 240 

Val Pro Phe His His Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu 

245 250 255 

He Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu Glu Leu 

260 265 270 

Phe Leu Arg Ser Leu Gin Asp Tyr Lys He Gin Ser Ala Leu Leu Val 

275 280 285 

Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu He Asp Lys Tyr 

290 295 300 

Asp Leu Ser Asn Leu His Glu He Ala Ser Gly Gly Ala Pro Leu Ser 
305 310 315 320 

Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His Leu Pro Gly He 

325 330 335 

Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala He Leu He Thr 

340 345 350 

Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys Val Val Pro Phe 

355 360 365 

Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys Thr Leu Gly Val 

370 375 380 

Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met He Met Ser Gly 
385 390 395 400 

Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu He Asp Lys Asp Gly 

405 410 415 

Trp Leu His Ser Gly Asp He Ala Tyr Trp Asp Glu Asp Glu His Phe 

420 425 430 

Phe He Val Asp Arg Leu Lys Ser Leu He Lys Tyr Lys Gly Tyr Gin 

435 440 445 

Val Ala Pro Ala Glu Leu Glu Ser He Leu Leu Gin His Pro Asn He 

450 455 460 

Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp Ala Gly Glu Leu 
465 470 475 480 

Pro Ala Ala Val Val Val Leu Glu His Gly Lys Thr Met Thr Glu Lys 

485 490 495 

Glu He Val Asp Tyr Val Ala Ser Gin Val Thr Thr Ala Lys Lys Leu 

500 505 510 

Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys Gly Leu Thr Gly 

515 520 525 

Lys Leu Asp Ala Arg Lys He Arg Glu He Leu He Lys Ala Lys Lys 

530 535 540 

Gly Gly Lys Ser Lys Leu 
545 550 



<210> 4 
<211> 286 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: pGFP+luc* 
plasmid 

<400> 4 

Met Ser He Gin His Phe Arg Val Ala Leu He Pro Phe Phe Ala Ala 

15 10 15 

Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr Leu Val Lys Val Lys 

20 25 30 

Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Gly Tyr He Glu Leu Asp 

35 40 45 

Leu Asn Ser Gly Lys He Leu Glu Ser Phe Arg Pro Glu Glu Arg Phe 

50 55 60 

Pro Met Met Ser Thr Phe Lys Val Leu Leu Cys Gly Ala Val Leu Ser 
65 70 75 80 



4% 
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Arg 


He 


Asp 


Ala 


Gly 


Gin 


Glu 


Gin 


Leu 


Gly 


Arg 


Arg 


He 


His 


Tyr 


Ser 










85 










90 










95 




Gin 


Asn 


Asp 


Leu 


Val 


Glu 


Tyr 


Ser 


Pro 


Val 


Thr 


Glu 


Lys 


His 


Leu 


Thr 








100 










105 










110 






Asp 


Gly 


Met 


Thr 


Val 


Arg 


Glu 


Leu 


Cys 


Ser 


Ala 


Ala 


He 


Thr 


Met 


Ser 






115 










120 










125 








Asp 


Asn 


Thr 


Ala 


Ala 


Asn 


Leu 


Leu 


Leu 


Thr 


Thr 


He 


Gly 


Gly 


Pro 


Lys 




130 










135 










140 










Glu 


Leu 


Thr 


Ala 


Phe 


Leu 


His 


Asn 


Met 


Gly 


Asp 


His 


Val 


Thr 


Arg 


Leu 


145 










150 










155 










160 


Asp 


Arg 


Trp 


Glu 


Pro 


Glu 


Leu 


Asn 


Glu 


Ala 


He 


Pro 


Asn 


Asp 


Glu 


Arg 










165 










170 










175 




Asp 


Thr 


Thr 


Met 


Pro 


Val 


Ala 


Met 


Ala 


Thr 


Thr 


Leu 


Arg 


Lys 


Leu 


Leu 








180 










185 










190 






Thr 


Gly 


Glu 


Leu 


Leu 


Thr 


Leu 


Ala 


Ser 


Arg 


Gin 


Gin 


Leu 


He 


Asp 


Trp 






195 










200 










205 








Met 


Glu 


Ala 


Asp 


Lys 


Val 


Ala 


Gly 


Pro 


Leu 


Leu 


Arg 


Ser 


Ala 


Leu 


Pro 




210 










215 










220 










Ala 


Gly 


Trp 


Phe 


He 


Ala 


Asp 


Lys 


Ser 


Gly 


Ala 


Gly 


Glu 


Arg 


Gly 


Ser 


225 










230 










235 










240 


Arg 


Gly 


He 


He 


Ala 


Ala 


Leu 


Gly 


Pro 


Asp 


Gly 


Lys 


Pro 


Ser 


Arg 


He 










245 










250 










255 




Val 


Val 


He 


Tyr 


Thr 


Thr 


Gly 


Ser 


Gin 


Ala 


Thr 


Met 


Asp 


Glu 


Arg 


Asn 








260 










265 










270 






Arg 


Gin 


He 


Ala 


Glu 


He 


Gly 


Ala 


Ser 


Leu 


He 


Lys 


His 


Trp 







275 280 285 
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INTERNATIONAL PRE LI 



ARY EXAMINATION REPORT 



[ional application No. 
PCT/FI00/00507 



I. Busis of the report 



1 With regard to the elements of the international application:* 
[Xj the international application as originally filed 

| | the description: 



pages 
pages 
pages 



, as originallv filed 



, filed with the demand 



, filed with the letter of 



| | the claims: 
pages 
pages 
pages 
pages 



, as originally filed 

, as amended (together with any statement) under article 19 

, filed with the demand 



filed with the letter of 



| | the drawings: 
pages 

pages 



, as originally filed 



, filed with the demand 



filed with the letter of 



1 1 the sequence listing part of the description: 
pages 

pages 

pages 



, as originally filed 



filed with the demand 



, filed with the letter of 



With regard to the language, all the elements marked above were available or furnished to this Authority in the language in which 
the international application was filed, unless otherwise indicated under this item. 

Those elements were available or furnished to this Authority in the following language which is: 



| | the language of a translation furnished for the purposes of international search (under Rule 23.1(b)). 
| | the language of publication of the international application (under Rule 48.3(b)). 

| I the language of the translation furnished for the purposes of international preliminary examination (under Rules 55 2 and/ 
1 1 or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international 
preliminary examination was carried out on the basis of the sequence listing: 

| | contained in the international application in written form. 

| | filed together with the international application in computer readable form. 

| | furnished subsequently to this Authority in written form. 

| | furnished subsequently to this Authority in computer readable form. 

I 1 1 he statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
1 international application as filed lias been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence listing has 
been furnished. 

4. 1 | The amendments have resulted in the cancellation of: 

| | the description, pages 

□ 



the claims, Nos. 



| | the drawings, sheet/ fig 



s I I 1 nis rc ^ n hils bccn established as if (some of) the amendments had not been made, since they have been considerai to go 
' 1 ' beyond the disclosure as filed, as indicated in the Supplemental Box (Rule 70.2 (c)).** 

* Replacement sheets which have been furnished to the receiving Office in response to an invitation under Article 14 are referred to 
in this report as "originally filed " and are annexed to this report since they do not contain amendments (Rules 70 16 

and 70.17). 

* Any replacement sheet containing such amendments must be referred to under item I and annexed to this report. 
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V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



Statement 



Novelty (N) Claims 

C'laims 



1-5 YHS 

NO 



Inventive step (IS) Claims ^.5 YES 

Claims NO 

Industrial applicability (IA) Claims ]_ _ 5 y^s; 

Claims 



Citations and explanations (Rule 70.7) 

The present application pertains to a method to enable 
assessment of the growth rate and death rate of a 
microorganism within a chosen time period. The method involves 
introducing into said microorganism at least two reporter 
genes encoding luminescent, e.g. luciferase, and/or 
fluorescent products, e.g. green fluorescent protein (GFP) . At 
least two of the 'following products are produced: 

i) a stable product produced in a known proportion to the 
total amount of cells that are or have been alive, 

ii) a product present in a known proportion to the amount of 
cells alive, and 

iii) a stable product produced in a known proportion to the 
total amount of cells that have died. 

The international search report revealed five documents of 
importance : 

Dl) File WPI, Derwent accession no. 1999-379000, 
ZH TETSUDO SOGO GIJUTSU KENKYUSHO: "Rapid 
identification of microorganism cell for 
measuring sterilization effect etc - involves 
using fluorescent pigments wich differ in their 
wavelengths for identifying dead and living cells"; 
& JP, A, 1114 6798, 19990602, DW199932 

D2) US 5164301 A (THOMAS E. THOMPSON ET AL) , 17 November 1992 
(17.11.92), column 4, lines 34-39, 42-56 



.../... 
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Supplemental Box 

(To be used when the space in any of the preceding boxes is not sufficient) 



Continuation of: BOX V 

D3) Dialog Information services, File 5, BIOSIS, 
Dialog accession no. 11223430, BIOSIS no. 
199800004762, Fratamico Pina M. et al : 
"Construction and characterization of 
Escherichia coli 0157 :H7 strains expressing 
firefly luciferase and green fluorescent 
protein ant their use in survival studies"; 
& Journal of Food Protection 60 (10) :pll67- 
1173 Oct . , 1997 

D4) WO 9814605 Al (LOMA LINDA UNIVERSITY), 9 April 1998 
(09.04.98), page 2, lines 4-7, 15-16 

D5) US 5824468 A (SIEGFRIED SCHERER ET AL) . 20 October 1998 
(20.10.98), column 3, line 8-line 9, claim 5 

Dl discloses the identification of dead and living cells using 
fluorescent pigments in order to measure e.g., the 
sterilization effect by pharmaceuticals for food. 

D2 concerns a process for detecting microbial metabolism. The 
process involves a two dye fluorescence emission system which 
detects microbial growth (refer to column 4, lines 34-39) . 

D3 discloses the construction and characterization of 
Escherichia coli 0157 :H7 strains expressing firefly luciferase 
and GFP. The strains may be used to monitor bacterial survival 
in foods and in a food processing environment. 

D4 pertains to a luciferase and green fluorescent protein 
fusion gene. The fusion gene is useful as a double marker for 
monitoring gene expression in living cells and enzymatic 
activity (refer to page 2, lines 15-16) . 

From Dl and D2 it is considered to be known to measure 
microbial growth by using fluorescence dyes to differentiate 
between living and dead cells. The methods in Dl and D2 do not 
involve the use of two different reporter genes within one 
microorganism. Further, it is known from D3 and D4 to use two 
different markers, luciferase and GFP, within the same 
microorganism. However, neither D3 nor D4 discloses that GFP 
can be used as a measure of cells that have been or are alive, 
thus making it possible in combination with luciferase to 
discriminate between living and dead cells. 

.../... 
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Supplement^] Box 

(To be used when the space in any of the preceding boxes is not sufficient) 



Continuation of: Box V 



Thus, it is not considered obvious to a person skilled in the 
art to use a lucif erase-and GFP fusion gene 'as a double marker 
to differentiate between living and dead cells in order to 
measure the growth rate and death rate of- a microorganism. 
Therefore, claims 1-5 are considered to be novel, to involve 
an inventive step and to have industrial applicability* 

D5 relates to a detection - procedure for bacteria of the genus 
Listeria. The procedure involves the use of a marker gene 
(refer to claim 5) . D5 is considered to show the general state 
of the art and is not considered to be of particular 
relevance. 
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Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 
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